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[i]  The  local  ensemble  transform  (ET)  analysis  perturbation  scheme  is  adapted  to 
generate  perturbations  to  both  atmospheric  variables  and  sea-surface  temperature  (SST). 

The  adapted  local  ET  scheme  is  used  in  conjunction  with  a  prognostic  model  of  SST 
diurnal  variation  and  the  Navy  Operational  Global  Atmospheric  Prediction  System 
(NOGAPS)  global  spectral  model  to  generate  a  medium-range  forecast  ensemble.  When 
compared  to  a  control  ensemble,  the  new  forecast  ensemble  with  SST  variation  exhibits 
notable  differences  in  various  physical  properties  including  the  spatial  patterns  of  surface 
fluxes,  outgoing  longwave  radiation  (OLR),  cloud  radiative  forcing,  near-surface  air 
temperature  and  wind  speed,  and  24-h  accumulated  precipitation.  The  structure  of  the  daily 
cycle  of  precipitation  also  is  substantially  changed,  generally  exhibiting  a  more  realistic 
midday  peak  of  precipitation.  Diagnostics  of  ensemble  performance  indicate  that  the 
inclusion  of  SST  variation  is  very  favorable  to  forecasts  in  the  Tropics.  The  forecast 
ensemble  with  SST  variation  outscores  the  control  ensemble  in  the  Tropics  across  a  broad 
set  of  metrics  and  variables.  The  SST  variation  has  much  less  impact  in  the  Midlatitudes. 

Further  comparison  shows  that  SST  diurnal  variation  and  the  SST  analysis  perturbations 
are  each  individually  beneficial  to  the  forecast  from  an  overall  standpoint.  The  SST 
analysis  perturbations  have  broader  benefit  in  the  Tropics  than  the  SST  diurnal  variation, 
and  inclusion  of  the  SST  analysis  perturbations  together  with  the  SST  diurnal  variation  is 
essential  to  realize  the  greatest  gains  in  forecast  performance. 
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doi:  10.1 029/201 1JD016937. 


1.  Introduction 

[2]  The  uppermost  layers  of  the  ocean  are  of  particular 
interest  for  their  regulation  of  various  atmospheric  pro¬ 
cesses.  Ocean-atmosphere  interaction  occurs  via  the  top  0.1- 
1 .0  mm  of  the  ocean,  referred  to  as  the  “skin  layer”  [Kawai  and 
Wada,  2007].  The  skin  layer  and  the  underlying  one  to  three 
meters  of  the  ocean  exhibit  diurnal  variation  that  is  typically  in 
the  range  0-1  K,  but  that  can  reach  up  to  several  degrees  in 
areas  such  as  the  Pacific  warm  pool  in  conditions  of  low  wind 
speed  and  clear  skies  [e.g.,  Flament  et  al.,  1994;  Ward,  2006; 
Kawai  and  Wada,  2007;  Bellenger  and Duvel,  2009].  There  is 
evidence  that  this  diurnal  variation  is  an  important  determinant 
of  the  surface  latent  and  sensible  heat  fluxes  that  modulate  the 
atmospheric  planetary  boundary  layer  [e.g.,  Clayson  and  Chen, 
2002;  Kawai  and  Wada,  2007].  Also,  the  diurnal  variation 
appears  to  affect  precipitation  rate  [Brunke  et  al.,  2008]  and 
local  atmospheric  circulations  that  are  driven  by  land-sea 
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thermal  contrasts,  such  as  the  sea  breeze  [Kawai  et  al.,  2006], 
Furthermore,  the  diurnal  variation  may  play  an  important  role 
in  the  Madden-Julian  Oscillation  (MJO).  Some  studies  have 
demonstrated  that  numerical  weather  prediction  (NWP)  simu¬ 
lation  of  the  propagation  of  the  MJO  from  the  Indian  Ocean 
region  is  improved  by  inclusion  of  diurnal  variability  in  the 
NWP  model  [Vitart  et  al,  2007;  Woolnough  et  al.,  2007; 
Tokay  a  et  al.,  2010a].  Also,  it  has  been  hypothesized  that 
diurnal  variation  acts  as  a  trigger  mechanism  for  shallow  con¬ 
vection  over  the  Indian  Ocean,  which  proceeds  to  moisten  and 
precondition  the  lower  troposphere  for  the  convectively  active 
phase  of  the  MJO  |  SI  in  go  et  al.,  2003].  Recent  observational 
evidence  seems  to  support  this  hypothesis  [Bellenger  et  al., 
2010].  SST  diurnal  variation  might  help  to  trigger  convection 
over  the  western  Pacific  Ocean  as  well  [Parsons  et  al.,  2000]. 

[3]  The  mounting  evidence  that  the  upper  ocean  diurnal 
cycle  influences  important  weather  phenomena  suggests  the 
importance  of  incorporating  upper  ocean  variability  into 
global  NWP  systems.  Efforts  in  this  regard  could  be 
worthwhile  in  spite  of  the  fact  that  upper  ocean  variability  is 
already  an  inherent  part  of  many  operational  seasonal  and 
climate  prediction  systems  [e.g.,  Arribas  et  al.,  2011],  This 
is  because  these  seasonal  and  climate  prediction  systems  are 
not  optimized  for  weather  prediction.  For  instance,  they 
typically  use  a  daily  or  longer  update  cycle,  as  opposed  to 
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the  standard  six-hourly  update  cycle  of  operational  NWP 
systems.  Also,  they  typically  use  a  ‘Tagged-start”  ensemble 
methodology  that  provides  no  theoretical  constraint  on  the 
sampling  of  initial-state  uncertainty.  To  date,  two  NWP  cen¬ 
ters  have  made  documented  efforts  to  incorporate  upper  ocean 
variability  into  a  global  NWP  system.  The  European  Centre 
for  Medium-Range  Weather  Forecasting  (ECMWF)  coupled 
the  K-Profile  Parameterization  (KPP)  one-dimensional  ocean 
mixed-layer  model  of  Large  et  al.  [1994]  with  the  ECMWF 
Integrated  Forecasting  System  (IFS)  and  tested  the  modified 
IFS  in  conjunction  with  the  ECMWF  51 -member  ensemble 
prediction  system  (EPS)  [Takaya  et  al,  2010a].  Diagnostics 
for  the  medium-range  forecast  interval  (0-16  d  lead  time) 
indicated  that  the  inclusion  of  the  KPP  model  improved  SST 
forecast  skill  at  the  10  d  lead  time  and  beyond,  as  well  as  MJO 
propagation  and  Indian  monsoon  rainfall.  A  brief  comparison 
also  was  made  between  the  diurnal  SST  amplitude  (DSA) 
produced  by  the  KPP  model  and  the  DSA  produced  by  the 
skin-layer  SST  scheme  of  Takaya  et  al.  [2010b],  Notwith¬ 
standing  its  relative  simplicity,  the  skin-layer  scheme  appeared 
to  have  a  slight  advantage  in  simulating  the  DSA.  One  nuance 
of  the  tests  with  the  EPS  is  that  the  ensemble  generation 
neglected  SST  initial-state  uncertainty,  such  that  all  NWP 
integrations  were  begun  from  the  same  prescribed  SST  field. 
Nevertheless,  research  is  ongoing  in  regard  to  using  SST 
analysis  perturbations  generated  by  the  ensembles  of  data 
assimilation  (EDA)  technique  [Buizza  et  al,  2010].  Mean¬ 
while,  the  Met  Office  of  the  United  Kingdom  has  introduced 
random,  spatially  correlated  SST  analysis  perturbations  within 
the  local  ensemble  transform  Kalman  filter  (ETKF)  of  its 
operational  Met  Office  Global  and  Regional  Ensemble  Pre¬ 
diction  System  (MOGREPS).  These  random  SST  analysis 
perturbations  provide  a  beneficial  increase  in  the  ensemble 
spread  of  surface  temperature  and  wind  [ Tennant  and  Beare, 
2011;  Tennant,  2012],  In  addition  to  the  efforts  at  ECMWF 
and  the  Met  Office,  Kunii  and  Miyoshi  [2012]  have  recently 
examined  SST  initial-state  uncertainty  in  an  ensemble  Kalman 
filter  (EKF). 

[4]  The  results  of  Takaya  et  al.  [2010a]  and  Tennant  and 
Beare  [2011]  indicate  that  global  NWP  ensembles  that  take 
into  account  upper  ocean  variability  can  produce  better 
forecasts  than  baseline  ensembles  that  lack  upper  ocean 
variability.  Given  these  promising  results,  the  present  study 
investigates  the  incorporation  of  a  variant  of  the  Takaya 
et  al.  [2010b]  diurnal  SST  scheme  into  the  NWP  compo¬ 
nent  of  the  U.S.  Navy’s  global  EPS.  In  addition,  this  study 
investigates  the  adaptation  of  the  local  ensemble  transform 
(ET)  ensemble  generation  scheme  of  McLay  et  al  [2010]  to 
generate  SST  analysis  perturbations.  Hence,  the  EPS  design 
for  the  present  study  is  distinct  from  those  of  Takaya  et  al. 
[2010a]  and  Tennant  and  Beare  [2011]  in  that  it  takes 
account  of  both  SST  diurnal  variation  and  SST  initial-state 
uncertainty.  The  new  EPS  is  used  to  produce  a  32-member, 
336  h  lead  time  global  forecast  ensemble  and  the  distribu¬ 
tions  of  SST  and  near-surface  atmospheric  variables  from 
this  ensemble  are  compared  with  those  of  a  control  ensemble 
generated  without  SST  analysis  perturbations  and  with  a 
prescribed  SST  field  that  is  fixed  throughout  integration.  The 
importance  of  the  SST  analysis  perturbations  is  also  investi¬ 
gated.  The  paper  is  organized  as  follows.  Section  2  describes 


the  data  and  methodology  including  the  atmospheric  and  SST 
analyses,  the  global  NWP  model,  the  diurnal  SST  scheme, 
the  ET  analysis  perturbations,  and  the  forecast  ensemble 
configurations.  Results  and  conclusions  are  presented  in 
sections  3  and  4,  respectively. 

2.  Data  and  Methodology 

2.1.  Analyses,  Numerical  Model,  and  Verification  Data 

[5]  Global  atmospheric  analyses  were  obtained  from  the 
Naval  Research  Laboratory  (NRL)  Atmospheric  Variational 
Data  Assimilation  System  (NAVDAS),  a  three-dimensional 
variational  assimilation  system  [Dalev  and  Barker,  2001a, 
2001b], 

[6]  SST  analyses  were  obtained  from  a  two-dimensional 
version  of  the  Navy  Coupled  Ocean  Atmosphere  Data 
Assimilation  System  (NCODA)  [Cummings,  2005],  The 
NAVDAS  analyses  were  generated  in  a  6  h  cycle  while 
the  NCODA  analyses  were  generated  in  a  12  h  cycle. 

[7]  Numerical  integrations  were  carried  out  using  the 
Navy  Operational  Global  Atmospheric  Prediction  System 
(NOGAPS),  a  primitive-equation,  frilly  parameterized  spec¬ 
tral  model  [Peng  et  al,  2004].  The  experiments  employ 
NOGAPS  at  horizontal  resolution  T 1 1 9  and  with  30  vertical 
levels,  the  same  resolution  as  used  for  the  Fleet  Numerical 
Meteorology  and  Oceanography  Center  (FNMOC)  opera¬ 
tional  forecast  ensemble  at  the  time  of  the  study.  Integrations 
to  the  336  h  lead  time  were  completed  for  each  00  UTC 
analysis  time  in  the  period  of  1  May  2007  to  30  June  2007. 

[8]  All  forecasts  in  section  3.2.2  were  verified  against  the 
NOGAPS/NCODA  analysis  with  the  exception  of  24-h  accu¬ 
mulated  precipitation,  which  was  verified  against  Tropical 
Rainfall  Measuring  Mission  (TRMM)  satellite-derived  daily 
rainfall  estimates  [Simpson  et  al.,  1988].  The  daily  cycle  of 
precipitation  (section  3.1.2)  was  verified  against  TRMM  three- 
hourly  rain-rate  estimates  (mm  h_ ')  that  were  converted  to  3  h 
accumulated  precipitation  values.  Note  that  the  TRMM 
daily  rainfall  estimates  and  the  TRMM  three-hourly  rain-rate 
estimates  are  two  different  TRMM  products.  The  DSA 
(section  3.1.3)  was  verified  against  observations  of  DSA 
derived  from  Geostationary  Operational  Environmental 
Satellite-East  (GOES-East)  and  GOES-West  raw  satellite 
observations  of  SST. 

2.2.  Prognostic  Model  of  SST 

[9]  Zeng  and  Beljaars  [2005]  use  the  one-dimensional 
heat  transfer  equation  to  derive  a  model  that  predicts  SST 
diurnal  variation  within  two  layers:  a  thin,  cool  skin  layer,  and 
a  3  m-deep  sub-skin  layer.  The  physical  processes  represented 
include  shortwave  and  longwave  radiative  flux,  evaporation, 
molecular  thermal  conduction,  and  wind-driven  turbulent  dif¬ 
fusion  determined  through  Monin-Obhukov  similarity  theory. 
Effects  related  to  wave-breaking,  the  Langmuir  circulation, 
precipitation,  and  buoyancy-driven  mixing  processes  are 
neglected.  Takaya  et  al.  [2010b]  refine  the  Zeng  and  Beljaars 
[2005]  model  by  including  a  different  stability  function  in  the 
definition  of  the  wind-driven  turbulent  diffiision  coefficient, 
and  by  including  mixing  effects  related  to  the  Langmuir  cir¬ 
culation  during  stable  conditions.  The  diurnal  model  of  SST 
that  was  incorporated  into  NOGAPS  for  the  present  study  is 
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that  of  Takaya  et  al.  [2010b],  but  excluding  the  explicit 
Langmuir  circulation  and  the  cool  skin  layer.  The  explicit 
Langmuir  circulation  is  neglected  because  a  wave  model  was 
not  readily  available  for  use.  However,  Langmuir  circulation 
effects  are  implicitly  modeled  by  multiplying  the  friction 
velocity  by  a  factor  of  1 .4  to  obtain  a  slight  enhancement  of 
surface  stress.  This  enhancement  was  validated  against  Global 
Drifter  Program  drifting-buoy  SST  observations  in  the  Tropics 
(not  shown).  The  cool  skin  layer  is  neglected  for  a  combina¬ 
tion  of  two  reasons.  The  first  is  that  the  amplitude  of  nighttime 
cooling  from  the  skin  layer  typically  makes  only  a  small  con¬ 
tribution  to  the  total  diurnal  SST  amplitude  [see  Gentemann 
et  al.,  2003,  Figure  1],  The  second  reason  is  to  facilitate  fore¬ 
cast  verification.  Note,  specifically,  that  forecasts  of  the  skin 
layer  SST  can  only  be  verified  against  certain  satellite  data 
(e.g.,  the  Ocean  Pathfinder  data  of  Gentemann  et  al.  [2003]), 
whereas  forecasts  of  the  sub-skin  layer  SST  can  be  verified 
against  the  NCODA  SST  analyses.  The  NCODA  SST  anal¬ 
yses  represent  the  warm  sub-skin  layer  by  design,  because 
NCODA  assimilates  satellite  radiances  by  regressing  them 
against  drifting-buoy  observations  which  measure  the  sub¬ 
skin  layer  (J.  Cummings,  personal  communication,  2012).  In 
terms  of  the  diurnal  SST  model,  neglect  of  the  cool  skin  layer 
prevents  the  SST  from  assuming  values  that  are  cooler  than  a 
base  SST,  where  the  base  SST  is  defined  to  be  the  analyzed 
SST  at  the  time  of  forecast  initialization.  The  diurnal  warm¬ 
ing  in  the  sub-skin  layer  is  not  affected.  Finally,  note  that  the 
SST  is  updated  at  every  time  step  (specifically,  every  600  s) 
during  the  NOGAPS  integrations. 

2.3.  Generation  of  Analysis  Perturbations 

[to]  The  ensemble  size  was  chosen  to  be  32,  commensu¬ 
rate  with  typical  ensemble  sizes  at  operational  centers.  With 
the  exception  of  the  control  forecast  ensemble,  each  ensem¬ 
ble  member’s  initial  conditions  included  perturbations  to 
SST  and  to  atmospheric  wind,  temperature,  specific  humid¬ 
ity,  and  terrain  pressure.  The  perturbations  to  wind  and 
temperature  were  calculated  for  all  vertical  levels  of  the 
numerical  model,  while  the  perturbations  to  specific  humid¬ 
ity  were  calculated  for  those  vertical  levels  between  the  sur¬ 
face  and  roughly  300  hPa. 

[n]  For  the  control  forecast  ensemble,  each  ensemble 
member’s  SST  field  was  held  constant  throughout  the  fore¬ 
cast  integration  and  was  prescribed  to  be  the  NCODA  SST 
analysis  for  the  given  ensemble’s  initialization  date. 

[12]  Perturbations  to  the  NAVDAS  and  NCODA  analyses 
were  generated  in  a  single,  unified  process  using  a  nine- 
banded  local  formulation  of  the  ensemble  transform  (ET) 
method.  The  fundamental  ET  equation  set  is 

Za  =®ZyT  :  Z^(Va)~IZa  =  Nl,  (1) 


and  ensures  that  the  perturbations  are  quasi-orthogonal.  For 
all  experiments  presented  in  this  paper,  Pa  is  diagonal  and  its 
respective  atmospheric  and  SST  elements  are  obtained 
directly  from  NAVDAS  and  NCODA.  Both  NAVDAS  and 
NCODA  estimate  the  elements  of  P„  using  a  method  based 
on  the  block  diagonal  preconditioner  of  the  conjugate  gra¬ 
dient  minimization  algorithm  and  Choleski  decomposition. 
The  method  is  detailed  in  Daley  and  Barker  [2001a]  and  briefly 
summarized  in  Daley  and  Barker  [2001b]  and  Cummings  and 
Smedstad  [2012],  In  general,  Pa  is  not  diagonal,  but  the  oper¬ 
ational  implementations  of  the  ET  at  the  National  Center  for 
Environmental  Prediction  (NCEP)  and  at  the  Fleet  Numerical 
Meteorology  and  Oceanography  Center  (FNMOC)  both 
employ  diagonal  P„  to  reduce  the  computational  expense  of 
forecast-ensemble  generation. 

[13]  McLay  et  al.  [2010]  realize  improved  performance 
from  the  ET  through  a  nine-banded  local  formulation.  In  this 
formulation,  the  global  NWP  domain  is  partitioned  into  nine 
non-overlapping  latitude  bands  and  perturbation  generation 
proceeds  in  three  steps. 

[14]  1.  Nine  different  [K  x  K\  weighting  matrices 
T ){j=  1,..,  9)  are  obtained,  one  for  each  of  the  nine  latitude 
bands.  Define  iy,  ZJj,  and  PJa  to  be  the  number  of  elements  of 
the  state  vector  within  band  j,  the  [iy  x  K]  forecast  pertur¬ 
bation  matrix  for  band  j,  and  the  \ty  x  iy\  diagonal  analysis- 
error  covariance  matrix  for  band  j,  respectively.  Then  T,-  is 
obtained  by  solving  the  standard  ET  eigen-equation 
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where  T)  =  CA  ^C7  (note  that  solution  of  this  eigen- 
equation  requires  special  treatment  of  a  zero-valued  eigen¬ 
value,  as  detailed  in  McLay  et  al.  [2008]).  Once  the  Ty-  are 
obtained,  they  are  “centered”  in  their  respective  latitude 
bands.  The  centers  of  the  T ]-  are  analogous  to  the  “localization 
centres”  of  Bowler  et  al.  [2009]. 

[15]  2.  Let  M  equal  the  number  of  Gaussian  latitudes  of  the 
NWP  model.  Then  for  each  Gaussian  latitude  m(m  =  1,..,  M), 
a  [K  x  K\  weighting  matrix  Toaussian  is  obtained  through 
linear  interpolation  of  the  centered  weighting  matrices 
T;C/=1,..,9). 

[16]  3 .  Define  Nm  to  be  the  number  of  elements  of  the  state 
vector  within  the  vertical  slice  of  domain  through  Gaussian 
latitude  m.  Then  for  each  Gaussian  latitude  m(m  =  1,..,  M), 
the  ensemble  transform  equation 

rjm  'yrtvyrn  /i\ 

1  Gaussian  W/ 


is  solved,  where  Zj"(Z”!)  is  the  [Nm  x  K]  forecast  (analysis) 
perturbation  matrix  for  Gaussian  latitude  m. 


where  N  is  the  size  of  the  NWP  model  state  vector  (here 
including  both  atmospheric  and  SST  elements),  K  is  the 
ensemble  size,  Zy  (Za)  is  an  N  x  K  matrix  of  forecast 
(analysis)  perturbations,  T  is  a  K  x  K  matrix  of  weighting 
coefficients,  Pa  is  a  N  x  N  matrix  of  analysis  error  covari¬ 
ance  estimates,  and  I  is  the  K  x  K  identity  matrix  [McLay 
et  al.,  2008],  The  constraint  Zj(Pa)_1Za  =  M  imparts  a 
magnitude  to  the  perturbations  that  is  consistent  in  a  domain- 
average  sense  with  estimates  of  analysis  error  covariance 


2.4.  Forecast-Ensemble  Configurations 

[17]  Four  different  forecast  ensembles  were  generated  for 
this  study. 

[is]  1.  Control.  An  ensemble  generated  with  no  SST 
variation  during  the  numerical  integration.  Each  ensemble 
member’s  SST  field  is  held  constant  throughout  the  inte¬ 
gration  and  is  prescribed  to  be  the  NCODA  SST  analysis  for 
the  given  ensemble’s  initialization  date. 
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[19]  2.  SSTidf.  An  ensemble  generated  with  both  diurnal 
forcing  of  SST  and  SST  analysis  perturbations  (idf  stands  for 
initial-perturbations  and  diurnal  forcing). 

[20]  3.  SSTdf:  An  ensemble  generated  with  diurnal  forcing 
of  SST  but  no  SST  analysis  perturbations.  At  analysis  time,  the 
SST  field  of  each  ensemble  member  is  identical  to  that  of  the 
NCODA  SST  analysis  for  the  given  ensemble’s  initialization 
date.  Note  that  although  there  are  no  SST  analysis  perturba¬ 
tions  the  diurnal  SST  forcing  is  still  capable  of  affecting  the 
atmospheric  analysis  perturbations  by  virtue  of  its  interaction 
with  the  atmospheric  forecast  perturbations  during  the  6  h 
cycling  interval. 

[21]  4.  SSTi:  An  ensemble  generated  with  SST  analysis 
perturbations  but  no  diurnal  forcing  of  SST.  The  analysis 
perturbations  are  taken  to  be  those  of  forecast  ensemble 
SSTidf,  to  ensure  that  the  perturbations  are  balanced.  Rec¬ 
ognize  that  each  ensemble  member’s  SST  field  is  different 
from  the  other  members’,  but  the  SST  field  is  held  constant 
throughout  the  integration. 

3.  Results 

3.1.  Diagnostics  of  Physical  Properties 

[22]  Forecast-ensemble  SSTidf  was  compared  to  the  con¬ 
trol  ensemble  in  terms  of  various  physical  variables  including 
24-h  accumulated  precipitation,  surface  latent-  and  sensible- 
heat  flux,  outgoing  longwave  radiation  (OLR),  cloud  radia¬ 
tive  forcing,  2-m  air  temperature,  and  10-m  wind  speed  to 
highlight  differences  to  these  variables  that  are  brought  about 
by  inclusion  of  the  diurnal  SST  model  and  the  SST  analysis 
perturbations.  Also,  ensemble  SSTidf  was  compared  to 
observations  in  terms  of  its  daily  cycle  of  precipitation  in  the 
Tropics  and  its  DSA  conditioned  on  wind  speed. 

3.1.1.  Differences  in  Physical  Variables 

[23]  All  differences  described  here  are  average  differences 
over  the  test  period.  Also,  the  statistical  significance  of  the 
average  difference  at  each  grid  point  was  tested  using  a  method 
that  accounts  for  serial  dependence  of  data  [Wilks,  2006,  pp. 
143-145]  and  only  differences  that  meet  a  95%  significance 
threshold  are  shown.  Additionally,  only  over-ocean  differences 
are  discussed,  except  in  the  case  of  24-h  accumulated  precipi¬ 
tation.  For  interpretation  purposes,  note  that  the  T  +  240  h 
(T  +  252  h)  forecast  is  valid  at  00  UTC  (12  UTC). 

[24]  1.  Surface  latent-heat  flux :  Figure  la  (Figure  lb) 
shows  the  difference  between  the  ensemble-mean  surface 
latent-heat  flux  at  T  +  240  h  (T  +  252  h)  in  SSTidf  and  that  in 
the  control.  One  can  infer  that  during  daylight  hours  (e.g., 
over  the  Pacific  Ocean  at  T  +  240  h)  the  latent-heat  flux  in 
SSTidf  is  greater  over  the  ocean  in  both  the  Tropics  and  the 
NFI  Midlatitudes,  as  might  be  expected.  During  nighttime 
hours  (e.g.,  over  the  Atlantic  Ocean  at  T  +  240  h),  the  latent- 
heat  flux  in  SSTidf  tends  to  be  smaller  over  the  ocean  in  the 
Tropics,  but  not  generally  in  the  NFI  Midlatitudes.  The 
latent-heat  flux  in  SSTidf  is  greater  in  the  equatorial  and 
southern  Indian  Ocean  at  both  day  and  night. 


[25]  2.  Surface  sensible-heat  flux'.  Figure  lc  (Figure  Id) 
shows  the  difference  between  the  ensemble-mean  surface 
sensible-heat  flux  at  T  +  240  h  (T  +  252  h)  in  SSTidf  and 
that  in  the  control.  As  with  the  latent-heat  flux,  the  sensible- 
heat  flux  in  SSTidf  during  daylight  hours  is  typically  greater 
over  the  ocean  in  both  the  Tropics  and  the  NH  Midlatitudes. 
At  night,  the  sensible-heat  flux  in  SSTidf  is  smaller  over  the 
ocean  in  the  Tropics,  the  NH  Subtropics,  and  the  higher- 
latitude  North  Pacific  Ocean,  the  sensible-heat  flux  is  not 
significantly  different  over  most  of  the  eastern  Indian  Ocean 
at  either  day  or  night. 

[26]  3.  OLR\  Figure  2a  (Figure  2b)  shows  the  difference 
between  the  ensemble-mean  OLR  at  T  +  240  h  (T  +  252  h) 
in  SSTidf  and  that  in  the  control.  At  T  +  240  h,  the  OLR  in 
SSTidf  is  substantially  smaller  (greater)  to  the  immediate 
north  (south)  of  the  Equator  throughout  the  Indian  Ocean 
and  the  western  and  central  Pacific  Ocean.  In  the  eastern 
Pacific  Ocean,  the  OLR  is  greater  over  the  Equator  and 
smaller  in  the  Subtropics.  Meanwhile,  the  OLR  is  smaller 
over  the  tropical  Atlantic  Ocean  and  greater  over  the  Bay  of 
Bengal.  The  same  general  patterns  are  seen  at  T  +  252  h.  The 
roughly  cross-equatorial  change  in  OLR  seen  in  the  Indian 
and  western  Pacific  Oceans  suggests  that  in  SSTidf  there  is  a 
strengthened  Inter-Tropical  Convergence  Zone  (ITCZ)  in 
these  regions,  with  deeper  convection  (i.e.,  colder  convec¬ 
tive  cloud  tops)  to  the  north  of  the  Equator  where  the  ITCZ 
is  located  in  NH  summer. 

[27]  4.  Cloud  radiative  forcing'.  Figure  2c  (Figure  2d) 
shows  the  difference  between  the  ensemble-mean  shortwave 
(longwave)  cloud  radiative  forcing  [SCRF  (LCRF)]  in 
SSTidf  and  that  in  the  control.  The  cloud  radiative  forcing  is 
defined  to  be  the  difference  between  the  total-sky  and  the 
clear-sky  net  radiative  flux  at  the  surface.  The  difference  in 
SCRF  generally  bears  a  direct  relationship  to  the  difference 
in  OLR  (Figure  2a),  with  notable  exceptions  in  the  equato¬ 
rial  central  and  eastern  Pacific  Ocean.  The  difference  in 
LCRF  is  roughly  anti-correlated  to  the  difference  in  SCRF, 
although  both  the  extent  and  magnitude  of  the  significant 
differences  in  LCRF  are  smaller  (Figure  2d).  From  a  global- 
mean  perspective,  the  ensemble-mean  SCRF  in  the  control 
(SSTidf)  is  —43.1  (—43.0)  WrcT2,  and  the  ensemble-mean 
LCRF  in  the  control  (SSTidf)  is  +34.1  (+34.0)  Wnf2  Hence, 
the  SST  variation  induces  a  slight  reduction  in  global-mean 
magnitude  of  both  the  SCRF  and  LCRF.  This  reduction  is 
statistically  significant. 

[28]  5.  Accumulated  precipitation  over  a  24-h  internal: 
Figure  3  shows  the  difference  between  the  ensemble-mean 
precipitation  accumulated  over  the  24-h  forecast  interval 
ending  at  T  +  240  h  in  the  SSTidf  ensemble  and  that  in  the 
control.  Ensemble  SSTidf  produces  more  precipitation  over 
the  ocean  in  certain  parts  of  the  Tropics,  including  the  West 
Pacific  warm  pool  region,  the  western  Indian  Ocean,  and  the 
eastern  Pacific  Ocean  near  Central  America.  Conversely, 
SSTidf  produces  less  precipitation  over  some  land  masses, 
including  the  Maritime  Continent  and  Central  America. 


Figure  1.  (a)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the  ensemble-mean 
surface  latent-heat  flux  (WnO2)  at  T  +  240  h.  Colored  shading  indicates  statistically  significant  average  differences, 
(b)  As  for  Figure  la  but  at  T  +  252  h.  (c)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms 
of  the  ensemble-mean  surface  sensible-heat  flux  (Wm~2)  at  T  +  240  h.  (d)  As  for  Figure  lc  but  at  T  +  252  h. 
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Figure  3.  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the 
ensemble-mean  precipitation  (mm)  accumulated  over  the  24-h  forecast  interval  ending  at  T  +  240  h. 
Colored  shading  indicates  statistically  significant  average  differences. 


[29]  6.  Air  temperature  at  2  nr.  Figure  4a  (Figure  4b) 
shows  the  difference  between  the  ensemble-mean  2-m  air 
temperature  at  T  +  240  h  (T  +  252  h)  in  SSTidf  and  that  in 
the  control.  Figure  4a  shows  that  at  T  +  240  h  the  2-m  air 
temperature  in  SSTidf  is  greater  over  a  number  of  regions, 
most  notably  the  subtropical  eastern  Pacific  Ocean,  but  also 
the  midlatitude  eastern  Pacific  Ocean,  the  Gulf  of  Mexico, 
the  western  Atlantic  Ocean,  the  Indian  Ocean,  and  the  West 
Pacific  warm  pool.  Figure  4b  shows  that  at  T  +  252  h  the  2-m 
air  temperature  in  SSTidf  is  substantially  greater  over  the 
West  Pacific  warn  pool,  and  also  is  greater  over  most  of  the 
Indian  Ocean  north  of  the  Equator.  These  difference  patterns 
suggest  that  the  diurnal  warming  of  SST  enhances  the  warm 
part  of  the  daily  cycle  of  2-m  air  temperature,  as  would  be 
expected. 

[30]  7.  Wind  speed  at  10  m:  Figure  4c  (Figure  4d)  shows 
the  difference  between  the  ensemble-mean  10-m  wind  speed 
at  T  +  240  h  (T  +  252  h)  in  SSTidf  and  that  in  the  control.  At 
both  T  +  240  h  and  T  +  252  h,  the  10-m  wind  speed  in 


SSTidf  is  substantially  higher  over  the  Indian  Ocean  and 
West  Pacific  warm  pool,  with  the  maximum  differ¬ 
ence  =0.5  ms~ 1 .  In  a  small  number  of  places  the  wind  speed 
is  lower,  notably  the  Bay  of  Bengal,  the  eastern  Pacific  south 
of  Panama,  and  in  the  vicinity  of  the  Gulf  of  Mexico. 

[31]  The  differences  noted  above  in  the  physical  variables’ 
ensemble-mean  properties  might  arise  for  two  reasons.  One 
is  the  fact  that  a  non-zero  mean  can  emerge  within  the  set  of 
ensemble  perturbations  as  the  perturbations  are  integrated, 
even  though  the  mean  of  the  perturbations  is  zero  by  design 
at  the  outset  of  integration.  The  emergence  of  a  non-zero 
mean  is  a  consequence  of  integration  with  a  nonlinear 
model,  and  can  occur  for  both  the  atmospheric  perturbations 
and  the  SST  perturbations.  The  other  possible  reason  for  the 
differences  in  the  ensemble-mean  properties  is  the  configu¬ 
ration  of  the  model  of  SST  diurnal  variation.  Recall  that  this 
model  superposes  the  diurnal  variation  on  a  base  tempera¬ 
ture  which  in  the  present  study  is  taken  to  be  the  SST  anal¬ 
ysis  for  the  initialization  date  of  the  forecast.  Also,  recall  that 


Figure  2.  (a)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the  ensemble-mean 
OLR  (Wm-2)  at  T  +  240  h.  Colored  shading  indicates  statistically  significant  average  differences,  (b)  As  for  Figure  2a 
but  at  T  +  252  h.  (c)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the  ensemble- 
mean  SCRF  (Wm'  )  averaged  over  the  336  h  forecast  interval.  Colored  shading  indicates  statistically  significant  average 
differences,  (d)  As  for  Figure  2c  but  for  LCRF. 
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the  SST  cannot  cool  below  the  base  temperature.  It  follows 
that  a  forecast  with  SST  diurnal  variation  will  have  a  slightly 
warmer  SST,  from  a  spatially  and  temporally  averaged  per¬ 
spective,  than  that  same  forecast  would  if  only  persisted  SST 
were  used.  Thus,  inclusion  of  the  SST  diurnal  variation 
enhances  not  just  the  variability  but  also  the  mean  of  the 
NWP  model’s  bottom  boundary  thermal  forcing.  Quantita¬ 
tively,  the  SST  diurnal  variation  causes  the  daily  average 
SST  to  increase  by  an  amount  typically  between  0.1 0°C  and 
0.25°C  in  those  tropical  regions  with  the  greatest  DSA,  with 
the  largest  increases  being  observed  in  the  South  China  Sea 
region.  The  daily  average  SST  in  the  midlatitude  North 
Pacific  and  North  Atlantic  Oceans  increases  by  an  amount 
typically  between  0.05°C  and  0.15°C.  To  investigate  if 
either  the  nonlinear  evolution  of  the  ensemble  perturbations 
or  the  SST  diurnal  variation  is  the  dominant  source  of  the 
differences  noted  above,  comparisons  were  made  between 
the  control  and  SSTi  ensembles  and  between  the  control  and 
SSTdf  ensembles  in  terms  of  ensemble-mean  surface  latent 
and  sensible  heat  flux  (not  shown).  In  general,  the  SSTi 
and  SSTdf  ensembles  exhibited  quantitatively  comparable 
differences  relative  to  the  control.  This  suggests  that  both 
the  nonlinear  evolution  of  the  ensemble  perturbations  and  the 
SST  diurnal  variation  make  important  contributions  to  the 
differences. 

3.1.2.  Diurnal  Cycle  of  Precipitation  in  the  Tropics 

[32]  As  shown  above,  the  inclusion  of  diurnal  SST  varia¬ 
tion  and  SST  analysis  perturbations  discemably  changes  the 
forecasts  of  variables  such  as  latent-heat  flux  that  play  a  role 
in  governing  convective  processes.  Given  these  changes,  it  is 
possible  that  the  timing  and  structure  of  the  forecast  daily 
cycle  of  precipitation  are  correspondingly  altered.  To  inves¬ 
tigate  this  possibility,  here  a  composite  daily  cycle  of  pre¬ 
cipitation  conditional  upon  wind  speed  regime  is  constructed 
for  forecast  SSTidf  and  compared  to  a  corresponding  com¬ 
posite  for  the  control  forecast  and  to  a  composite  of  the 
observed  daily  cycle  as  derived  from  TRMM  satellite  data 
(described  above  in  section  2.1). 

[33]  The  calculation  of  the  composite  forecast  daily  cycle 
of  precipitation  proceeds  as  follows.  First,  a  longitude  band 
of  10°  width  is  chosen  (e.g.,  0°E  to  10°E,  10°E  to  20°E,  .., 
350°E  to  360°E).  Then,  the  compositing  region  is  defined  as 
the  intersection  of  this  longitude  band  with  the  20°  S  to  20°  N 
tropical  latitude  band.  The  use  of  10°  longitude  bands 
ensures  that  all  grid  points  within  the  compositing  region  are 
less  than  1  h  apart  in  local  time.  Next,  a  wind  speed  regime,  a 
24-h  forecast  interval  (e.g.,  T  +  240  h  to  T  +  264  h),  and  a 
particular  grid  point  (on  a  1°  x  1°  latitude-longitude  grid) 
within  the  compositing  region  are  chosen.  Then,  for  each 
00  UTC  forecast  initialization  in  the  test  period  and  each 
ensemble  member,  a  check  is  made  to  ensure  that  the 
ensemble  member’s  10-m  wind  speed  forecast  at  the  chosen 
grid  point  meets  the  specified  wind  speed  criterion  (e.g., 


0-3  ms-1)  at  every  three-hourly  point  of  the  forecast  interval 
(e.g.,  T  +  240  h,  T  +  243  h,  T  +  246  h,  ..).  If  the  wind  speed 
meets  the  criterion,  then  the  ensemble  member’s  sequence  of 
three-hourly  accumulated  precipitation  for  the  24-h  forecast 
interval  is  added  to  the  composite.  When  the  sequence  of 
precipitation  is  added  to  the  composite,  the  sequence  is 
normalized  by  its  maximum  value.  This  normalization 
ensures  that  sequences  with  light  precipitation  get  the  same 
weight  in  the  composite  as  sequences  with  heavy  precipita¬ 
tion.  The  preceding  process  is  repeated  for  all  00  UTC 
forecast  initializations,  all  ensemble  members,  and  all  grid 
points  within  the  compositing  region.  Once  the  composite 
sequence  is  obtained,  it  is  normalized  by  its  maximum  value. 
This  final  normalization  removes  differences  in  amplitude 
between  one  composite  sequence  and  another  that  arise 
because  of  the  different  number  of  sequences  that  comprise 
each  composite. 

[34]  Calculation  of  the  composite  observed  daily  cycle  of 
precipitation  proceeds  in  much  the  same  way  as  for  the 
composite  forecast  daily  cycle.  First,  the  TRMM  data  are 
interpolated  to  the  same  1°  x  1°  latitude-longitude  grid  as  the 
forecast  data.  Then,  a  wind  speed  regime,  a  date,  and  a  par¬ 
ticular  grid  point  within  the  compositing  region  are  chosen.  A 
check  is  made  to  ensure  that  the  analyzed  1 0-m  wind  speed  at 
the  grid  point  meets  the  specified  wind  speed  criterion  at  all 
four  daily  analysis  times  00,  06,  12,  and  18  UTC.  If  the  wind 
speed  meets  the  criterion,  then  the  TRMM  sequence  of  three- 
hourly  accumulated  precipitation  for  the  date  is  added  to  the 
composite.  The  TRMM  sequences  used  for  the  composite  are 
normalized  in  the  same  fashion  as  the  sequences  used  for  the 
composite  forecast  daily  cycle. 

[35]  Figure  5  shows  the  composite  daily  cycle  of  precipi¬ 
tation  for  the  SSTidf  ensemble,  the  control  ensemble,  and 
the  TRMM  precipitation  observations  for  six  different 
compositing  regions.  The  composite  forecast  daily  cycles  are 
for  the  interval  from  T  +  120  h  to  T  +  144  h.  Also,  all 
composites  are  for  the  low-wind  speed  (0-3  ms-1)  regime, 
since  one  would  expect  the  largest  signal  from  the  diurnal 
SST  forcing  in  this  regime.  Considering  the  western  and 
eastern  Indian  Ocean  compositing  regions  (Figures  5a  and 
5b),  one  sees  a  prominent  midday  precipitation  peak  in  the 
TRMM  observations.  There  is  little  to  no  indication  of  this 
midday  peak  in  the  control  ensemble,  whereas  there  is  a 
pronounced  midday  peak  in  the  SSTidf  ensemble,  albeit 
three  hours  too  late  in  both  regions.  All  three  composite 
cycles  also  show  an  early  morning  precipitation  peak. 
However,  in  the  case  of  the  SSTidf  ensemble,  this  peak  is 
somewhat  diminished  in  relative  prominence  as  compared  to 
the  midday  peak.  The  timing  of  the  early  morning  peak  for 
the  western  Indian  Ocean  (Figure  5a)  is  three  hours  too  late 
in  both  the  control  and  the  SSTidf  ensembles.  For  the  South 
China  Sea  compositing  region  (Figure  5c),  the  representation 
of  the  midday  precipitation  peak  in  the  SSTidf  ensemble  is 


Figure  4.  (a)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the  ensemble-mean  2-m 
air  temperature  (°C)  at  T  +  240  h.  Colored  shading  indicates  statistically  significant  average  differences,  (b)  As  for  Figure  4a 
but  at  T  +  252  h.  (c)  Average  difference  between  the  SSTidf  ensemble  and  the  control  ensemble  in  terms  of  the  ensemble- 
mean  10-m  wind  speed  (ms^1)  at  T  +  240  h.  (d)  As  for  Figure  4c  but  at  T  +  252  h. 
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not  much  different  than  that  in  the  control.  However,  the 
relative  prominence  of  the  early  morning  peak  is  again 
diminished  in  the  SSTidf  ensemble.  The  timing  of  the  early 
morning  peak  is  three  hours  too  late  in  both  the  control  and 
SSTidf  ensembles.  Considering  the  western  Pacific  Ocean 
compositing  region  (Figure  5d),  one  sees  again  that  the 
control  ensemble  fails  to  represent  the  midday  precipitation 
peak  whereas  the  SSTidf  ensemble  is  able  to  capture  it.  The 
early  morning  peak  is  slightly  diminished  in  prominence  in 
the  SSTidf  ensemble,  but  to  much  less  of  an  extent  than  for 
the  Indian  Ocean  or  South  China  Sea  regions.  The  timing  of 
the  early  morning  peak  is  unchanged  in  the  SSTidf  ensemble 
as  compared  to  the  control.  For  the  eastern  Pacific  Ocean 
compositing  region  (Figure  5e),  the  afternoon  precipitation 
peak  is  more  prominent  in  the  SSTidf  ensemble  as  compared 
to  the  control  while,  conversely,  the  early  morning  peak  in 
the  SSTidf  ensemble  is  less  pronounced.  Thus,  the  SSTidf 
ensemble  better  represents  both  the  afternoon  and  early 
morning  peaks  of  the  daily  cycle,  unlike  the  cases  for  the 
Indian  Ocean  and  western  Pacific  Ocean  compositing  regions. 
Nonetheless,  the  timing  of  the  early  morning  peak  remains 
three  hours  too  late  in  the  SSTidf  ensemble.  Last,  considering 
the  Atlantic  Ocean  compositing  region  (Figure  5f ),  one  finds 
not  much  difference  between  the  daily  cycles  of  the  SSTidf 
and  control  ensembles.  The  timing  of  the  early  morning  peak 
is  six  hours  too  late  in  both  ensembles,  and  both  ensembles 
fail  to  capture  the  early  afternoon  and  late  evening  peaks. 
Even  so,  the  SSTidf  ensemble  is  arguably  more  realistic  than 
the  control  because  it  suggests  the  occurrence  of  more  pre¬ 
cipitation  during  the  afternoon  and  evening  timeframe.  The 
improvement  noted  above  in  the  midday  precipitation  peak  is 
consistent  with  the  notion  that  the  SST  diurnal  variation  with 
its  attendant  higher  peak  daily  SST  and  higher  latent  and 
sensible  heat  fluxes  enhances  the  NWP  model’s  ability  to 
trigger  atmospheric  convection.  On  the  other  hand,  the  SST 
diurnal  variation's  suppression  of  the  early  morning  precipi¬ 
tation  peak  in  certain  regions  such  as  the  South  China  Sea  is 
not  so  readily  interpreted,  and  will  require  further  work  to 
understand. 

[36]  Composite  daily  cycles  of  precipitation  also  were 
separately  produced  for  those  cases  where  the  daily  accu¬ 
mulated  precipitation  amount  was  below  median,  above 
median,  below  third  quartile,  and  above  third  quartile  (not 
shown).  The  median  and  third  quartile  were  calculated  for 
each  respective  data  set  and  region,  e.g.,  regionally  varying 
medians  and  third  quartiles  were  calculated  specific  to  each 
of  the  control,  SSTidf,  and  TRMM  data  sets.  All  of  these 
composites  yield  the  same  general  results  as  above,  with 
minor  case  to  case  variation.  Additionally,  the  composite 
daily  cycle  of  precipitation  for  each  of  these  cases  was  rep¬ 
licated  without  the  normalization  of  each  sequence  in  the 


composite  by  its  maximum  value.  These  composites  again 
yield  the  same  general  results  as  above. 

3.1.3.  Diurnal  SST  Amplitude  Conditioned 
on  Wind  Speed 

[37]  The  diurnal  cycle  that  is  observed  in  skin-layer  SST  is 
a  sensitive  function  of  wind  speed,  since  strong  winds  will 
induce  mixing  that  acts  to  reduce  thermal  stratification  in  the 
upper  ocean  [Gentemann  et  al.,  2003;  Kawai  and  Wada, 
2007].  Existing  literature  suggests  that  SST  forecasts  tend 
to  have  some  difficulty  in  representing  this  function  [e.g., 
Takaya  et  al.,  2010a].  Thus,  here  composites  of  the  DSA  for 
ensemble  SSTidf  are  constructed  both  with  and  without 
conditioning  upon  wind  speed.  The  resulting  composites  are 
then  compared  to  composite  observations  of  DSA  derived 
from  raw  satellite  data.  For  the  conditioning  upon  wind 
speed,  three  different  regimes  of  daily  average  wind  speed 
are  defined:  Low  wind  speed  (0-3  ms-1),  intermediate  wind 
speed  (3-5  ms-1)  ,  and  high  wind  speed  (>5  ms-1).  The 
DSA  for  a  given  location  and  24-h  interval  is  calculated 
as  the  maximum  value  of  SST  at  that  location  within  the 
24-h  interval  minus  the  minimum  value  of  SST  at  that 
location  within  the  24-h  interval. 

[38]  The  calculation  of  composite  forecast  DSA  proceeds 
as  follows.  A  wind  speed  regime,  a  24-h  forecast  interval 
(e.g.,  T  +  120  h  to  T  +  144  h),  and  a  particular  grid  point  on 
a  1°  x  1°  latitude-longitude  grid  are  chosen.  Then,  for  each 
00  UTC  forecast  initialization  in  the  test  period  and  each 
ensemble  member,  a  check  is  made  to  ensure  that  the 
average  over  the  forecast  interval  of  the  ensemble  member’s 
10-m  wind  speed  meets  the  specified  wind  speed  criterion 
(0-3  ms-1  or  >5  ms-1).  If  the  wind  speed  meets  the  criterion, 
the  DSA  is  calculated  and  collected.  This  process  is  repeated 
for  all  00  UTC  forecast  initializations  and  all  ensemble 
members,  and  the  resulting  collection  of  DSA  values  are 
averaged  to  produce  the  composite  at  the  chosen  grid  point. 

[39]  Figure  6a  shows  the  unconditioned  composite  fore¬ 
cast  DSA  for  ensemble  SSTidf  for  the  interval  from 
T  +  120  h  to  T  +  144  h.  The  unconditioned  forecast  DSA 
exhibits  a  maximum  in  the  eastern  Indian  Ocean  that  is  on 
average  1.2-1.4°C.  Slightly  weaker  maxima  of  =T°C  are 
seen  in  the  West  Pacific  warm  pool  and  along  the  west  coast 
of  Mexico.  Unconditioned  forecast  DSA  greater  than  0.2°C 
is  common  in  many  other  parts  of  the  NH. 

[40]  The  forecast  DSA  conditional  upon  the  low  wind 
speed  regime  for  the  interval  from  T  +  120  h  to  T  +  144  h 
(Figure  6b)  is  uniformly  larger  than  the  unconditioned  fore¬ 
cast  DSA,  as  expected.  Maximum  conditional  DSA  is  1.8°C 
or  more  in  places  in  the  Subtropics  and  lower  Midlatitudes, 
including  most  prominently  the  lower-latitude  portions  of  the 
eastern  North  Pacific  and  North  Atlantic  Oceans.  Values 
greater  than  1°C  are  common  elsewhere  in  the  NH. 


Figure  6.  (a)  Composite  forecast  DSA  (°C)  for  the  T  +  120  h  to  T  +  144  h  interval  from  the  SSTidf  ensemble,  uncondi¬ 
tional  upon  wind  speed  regime,  (b)  As  for  Figure  6a  but  conditional  upon  the  0-3  ms-1  wind  speed  regime.  Black  shading 
denotes  regions  with  no  realizations  of  forecast  DSA  for  this  wind  regime,  (c)  Difference  between  the  composite  forecast 
DSA  for  the  T  +  108  h  to  T  +  132  h  forecast  interval  and  the  composite  observed  DSA  for  the  low  wind  speed  regime 
(0-3  ms-1).  Black  shading  denotes  regions  with  no  realizations  of  forecast  DSA  and/or  no  observations  of  DSA  for  this  wind 
regime,  (d)  As  for  Figure  6c  but  for  the  high  wind  speed  regime  (>5  ms-1). 
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[41]  To  validate  the  forecast  DSA,  observations  of  DSA 
were  derived  from  GOES-East  and  GOES-West  raw  satellite 
observations  of  SST  and  composited  conditional  upon  ana¬ 
lyzed  wind  speed  regime.  The  GOES  raw  satellite  observa¬ 
tions  of  SST  were  obtained  from  the  NCODA  data 
assimilation  system  for  each  date  in  the  test  period.  Since  the 
footprint  of  the  GOES  observations  encompasses  the  region 
roughly  from  180°W  to  20°W,  the  calculation  of  composite 
observed  DSA  was  restricted  to  this  region.  Calculation  of 
the  composite  observed  DSA  proceeds  as  follows.  For  a 
given  date,  all  the  satellite  SST  observations  within  a  24-h 
interval  ending  at  12  UTC  of  that  date  are  interpolated  to  a 
1°  x  1°  latitude-longitude  grid  using  the  nearest-neighbor 
method.  Next,  a  wind  speed  regime  and  a  particular  grid 
point  are  chosen,  and  a  check  is  made  to  ensure  that  the 
average  over  the  24-h  interval  of  the  analyzed  10-m  wind 
speed  at  the  grid  point  meets  the  specified  wind  speed  cri¬ 
terion.  If  the  wind  speed  meets  the  criterion,  the  DSA  is 
calculated  from  the  interpolated  satellite  SST  observations. 
This  process  is  repeated  for  all  dates,  and  the  resulting  col¬ 
lection  of  DSA  observations  are  averaged  to  produce  the 
composite  at  the  chosen  grid  point. 

[42]  Figures  6c  and  6d  show  the  difference  between  the 
composite  forecast  DSA  for  the  T  +  108  h  to  T  +  132  h 
forecast  interval  and  the  composite  observed  DSA  for  the 
low  wind  speed  and  high  wind  speed  regimes,  respectively. 
These  difference  fields  are  equivalent  to  the  average  over  the 
test  period  of  the  error  in  the  ensemble-mean  forecast  DSA 
for  the  given  wind  speed  regime.  From  Figure  6c,  it  is 
apparent  that  in  the  low  wind  speed  regime  the  forecast  DSA 
is  too  large  as  compared  to  observations  in  most  parts  of  the 
NFI  extratropical  Pacific  and  Atlantic  Oceans.  The  only 
notable  exception  is  in  the  gulf  stream  current  off  the 
northeast  coast  of  the  United  States.  The  most  prominent 
excesses  of  the  NH  extratropical  forecast  DSA  are  found  in 
the  Atlantic  Ocean,  where  difference  values  of  0.5-1 .0°C  are 
common.  It  is  not  clear  what  the  cause  of  these  excesses  is, 
although  one  can  speculate  that  it  is  probably  due  to  exces¬ 
sive  solar  insolation  arising  from  insufficient  clouds,  or  to 
insufficient  wind-driven  mixing  of  the  ocean  surface. 

[43]  For  the  Tropics  in  Figure  6c,  the  forecast  DSA  also 
tends  to  be  too  large,  particularly  near  the  West  African  coast. 
However,  in  the  equatorial  East  Pacific  Ocean  the  forecast 
DSA  is  too  small.  For  the  SH,  the  forecast  DSA  tends  to  be 
too  large  in  the  Subtropics  and  too  small  in  the  Midlatitudes. 

[44]  Considering  the  DSA  differences  for  the  high  wind 
speed  regime  (Figure  6d),  it  is  apparent  that  the  forecast 
DSA  is  too  small  over  the  entire  compositing  region.  The 
most  notable  differences  are  found  in  the  gulf  stream  current, 
in  the  eastern  Pacific  Ocean  along  the  southwest  coast  of 
North  America,  and  along  the  Equator  in  the  Pacific  Ocean. 

[45]  Validation  of  the  forecast  DSA  was  also  indepen¬ 
dently  performed  against  observations  of  DSA  derived  from 
Global  Drifter  Program  drifting-buoy  observations,  with 


results  similar  to  those  above  (not  shown).  Thus,  the  above 
results  point  to  the  need  for  future  work  to  fine-tune  the 
forecast  DSA.  They  also  raise  the  possibility  that  such  fine- 
tuning  could  yield  forecast  performance  gains  beyond  those 
discussed  in  section  3.2.2. 

3.2.  Diagnostics  of  Ensemble  Properties 

3.2.1.  Ensemble  SST  Variance 

[46]  The  sample  variance  of  the  ensemble  SST  is  important 
as  a  measure  of  the  size  and  spatial  structure  of  the  uncer¬ 
tainty  in  the  SST  forecast.  This  section  examines  the  nature 
of  the  ensemble  SST  variance  both  at  the  forecast  initial  time 
and  after  the  SST  has  been  evolved  by  the  NWP  model. 

3.2.1. 1.  Initial-Time  Ensemble  SST  Variance 

[47]  The  T  +  000  h  ensemble  SST  variance  is  theoretically 
constrained  by  the  NCODA  SST  analysis  error  variance,  as 
per  equation  (1).  Figures  7a  and  7b  respectively  show  the 
T  +  000  h  ensemble  SST  variance  and  the  NCODA  SST 
analysis  error  variance  for  00  UTC  31  May  2007.  From 
Figure  7a  it’s  apparent  that  the  ensemble  has  substantial 
initial-time  SST  variance  throughout  the  Tropics  and  Sub¬ 
tropics,  as  well  as  certain  parts  of  the  NH  Midlatitudes. 
Variance  is  largest  in  the  West  Pacific  warm  pool  region 
near  the  Philippines  and  in  the  waters  surrounding  Central 
America.  Comparing  the  ensemble  SST  analysis  perturba¬ 
tion  variance  (Figure  7a)  with  the  NCODA  SST  analysis 
error  variance  (Figure  7b),  the  most  obvious  relationship  is 
that  the  ensemble  variance  is  too  large  in  most  areas  of  the 
Tropics.  On  the  other  hand,  the  peak  values  of  ensemble 
variance  in  the  NH  Midlatitudes  are  the  same  as  those  for  the 
NCODA  error  variance.  The  overly  large  tropical  T  +  000  h 
ensemble  variance  is  a  known  property  of  the  local  ET  [see 
McLay  et  al.,  2010,  Figure  5c]  and  follows  on  theoretical 
grounds  from  the  large  difference  between  the  ensemble  size 
K  and  the  rank  of  the  matrices  P„.  This  excess  variance  can 
be  mitigated  in  the  local  ET  method  by  tuning  (i.e.,  down- 
scaling)  the  relevant  elements  of  P„,  but  the  series  of 
experiments  required  for  such  tuning  were  considered 
beyond  the  scope  of  this  study.  It  is  worth  noting  that  such 
tuning  may  be  a  simple  way  to  obtain  performance  gains 
over  and  above  those  discussed  in  this  study. 

[48]  Further  illustration  of  the  initial-time  ensemble  SST  is 
provided  by  Figure  7c,  which  shows  the  SST  analysis  per¬ 
turbation  of  member  12  of  the  ensemble  initialized  00  UTC 
31  May  2007.  Most  evident  in  Figure  7c  is  the  abundant 
smaller-scale  structure  of  the  analysis  perturbation.  Also 
notable  is  the  long  perturbation  feature  that  stretches  south- 
westward  from  Canada  to  the  Central  Pacific  Ocean.  This 
perturbation  feature  marks  a  region  of  relatively  low  10-m 
wind  speed  on  the  equatorward  edge  of  the  midlatitude 
atmospheric  frontal  zone  (not  shown),  and  is  also  coincident 
with  relatively  large  NCODA  analysis  error  variance 
(Figure  7b).  The  feature  highlights  the  fact  that  the  SST 


Figure  7.  (a)  Ensemble  SST  analysis  perturbation  variance  ([°C]2)  for  the  00  UTC  3 1  May  2007  initialization  of  the  SSTidf 
ensemble,  (b)  NCODA  SST  analysis  error  variance  ([°C]2)  for  00  UTC  31  May  2007.  (c)  SST  analysis  perturbation  (°C)  of 
member  12  of  the  SSTidf  ensemble  initialized  00  UTC  31  May  2007. 
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perturbations  from  the  ET  method  have  structural  relation¬ 
ships  with  the  atmospheric  flow. 

3.2. 1.2.  Evolved  Ensemble  SST  Variance 

[49]  When  a  nonlinear,  chaotic  system  is  evolved  from 
imperfect  initial  conditions,  the  forecast  uncertainty  as  mea¬ 
sured  by  the  variance  is  expected  to  display  bounded  growth 
over  some  sufficiently  long  evolution  period.  The  mechan¬ 
isms  of  ensemble  SST  variance  growth  for  the  present  system 
are  quite  limited,  deriving  only  from  the  interaction  between 
winds,  surface  fluxes,  radiative  fluxes,  and  SST  within  the 
model  of  SST  diurnal  variation.  Nevertheless,  some  ensem¬ 
ble  SST  variance  growth  still  arises,  as  Figure  8a  helps  to 
illustrate.  Figure  8a  specifically  shows  the  amplification 
factor  of  the  ensemble  SST  variance  over  the  interval 
T  +  000  h  to  T  +  240  h,  averaged  over  all  00  UTC  forecast- 
ensemble  initializations.  The  amplification  factor  of  the 
ensemble  SST  variance  is  defined  as  the  ratio  between  the 
variance  at  T  +  240  h  and  the  variance  at  T  +  000  h.  One  notes 
some  variance  growth  across  most  of  the  ocean  basins, 
although  the  most  prominent  growth  is  clearly  confined  to  the 
central  and  eastern  North  Pacific  Ocean.  The  growth  that 
occurs  in  the  Tropics  is  weak  and  is  generally  described  by  an 
amplification  factor  of  1  to  1.5.  Additionally,  there  is  weak 
decay  of  variance  (proportionally  less  than  10%)  in  the 
tropical  Pacific  Ocean  in  the  immediate  vicinity  of  the 
Equator,  as  well  as  in  the  West  Pacific  warm  pool  region.  For 
comparison,  Figure  8b  presents  the  amplification  factor  of 
the  SST  forecast  error  variance  over  the  interval  T  +  000  h  to 
T  +  240  h,  where  the  amplification  factor  of  the  SST  forecast 
error  variance  is  defined  as  the  ratio  between  the  SST  forecast 
error  variance  at  T  +  240  h  and  the  average  NCODA  SST 
analysis  error  variance.  The  SST  forecast  error  variance  is 
estimated  as  the  variance  of  a  sample  of  errors  in  the  ensemble- 
mean  SST  forecast.  The  sample  of  errors  is  composed  by  first 
calculating  the  forecast  error  for  each  00  UTC  initialization  of 
the  forecast  ensemble  in  the  period  01  May  2007  to  30  June 
2007  and  then  pooling  together  the  forecast  errors  from  all 
initializations.  Comparing  Figures  8a  and  8b  one  finds  that  the 
ensemble’s  amplification  factor  approaches  that  of  the  forecast 
error  in  the  central  and  eastern  North  Pacific  Ocean,  but  that 
elsewhere  the  ensemble’s  amplification  factor  is  considerably 
smaller.  The  difference  between  the  two  is  particularly  pro¬ 
nounced  throughout  most  of  the  Tropics.  Some  large  differ¬ 
ences  are  to  be  expected  given  the  limited  mechanisms  of 
ensemble  SST  variance  growth  for  the  present  system.  How¬ 
ever,  it’s  also  possible  that  NCODA  underestimates  the  SST 
analysis  error  variance  in  certain  regions,  thereby  inflating  the 
forecast  error  amplification  factor. 

[50]  Figures  8c  and  8d  show  the  evolved,  T  +  240  h 
ensemble  SST  variance  and  the  T  +  240  h  SST  forecast  error 
variance,  respectively.  The  ensemble  SST  variance  is  aver¬ 
aged  over  all  00  UTC  forecast-ensemble  initializations  in  the 
period  1  May  2007-30  June  2007.  The  SST  forecast  error 


variance  is  calculated  as  described  above  for  Figure  8b. 
Figure  8c  shows  that  the  ensemble  maintains  SST  variance  in 
all  of  the  tropical  oceans  as  well  as  throughout  the  North 
Pacific  Ocean,  the  Gulf  of  Mexico,  and  the  southern  and 
western  parts  of  the  North  Atlantic  Ocean.  The  most  pro¬ 
nounced  variance  is  in  the  West  Pacific  warm  pool  region 
and  near  the  west  coast  of  Mexico  and  Central  America. 
Comparing  Figures  8c  and  8d,  one  sees,  not  surprisingly,  that 
the  ensemble  variance  is  much  smaller  than  the  forecast  error 
variance  over  much  of  the  North  Pacific  Ocean,  in  the  west¬ 
ern  Indian  Ocean,  in  the  vicinity  of  the  Gulf  Stream  current  in 
the  North  Atlantic  Ocean,  and  over  the  Equatorial  East 
Pacific  Ocean.  On  the  other  hand,  the  ensemble  SST  variance 
is  somewhat  excessive  in  the  West  Pacific  warm  pool  region 
and  in  the  East  Pacific  Ocean  near  Central  America. 

3.2.2.  Ensemble  Verification  Metrics 

[51]  Figures  9a-9c  summarize  the  performance  of  the 
SSTidf  forecast  ensemble  in  relation  to  the  control  ensemble 
at  T  +  120  h,  T  +  240  h  and  T  +  336  h,  respectively,  in  terms  of 
four  metrics  including  (1)  the  bias  of  the  ensemble-mean 
forecast,  (2)  the  root-mean  square  (RMS)  error  of  the  ensemble- 
mean  forecast  with  bias  removed  prior  to  calculation,  (3)  a 
variance  metric  that  measures  the  absolute  difference  between 
the  forecast  ensemble  variance  and  an  estimate  of  the  observed 
forecast  error  variance,  and  (4)  the  Continuous  Ranked 
Probability  Score  (CRPS).  For  the  variance  metric,  the 
observed  forecast  error  variance  is  estimated  as  the  variance  of 
a  sample  of  errors  in  the  ensemble-mean  forecast.  The  sample 
of  errors  is  composed  by  first  calculating  the  forecast  error  for 
each  00  UTC  initialization  of  the  forecast  ensemble  in  the 
period  1  May  2007  to  30  June  2007  and  then  pooling  together 
the  forecast  errors  from  all  initializations.  The  CRPS  metric  is 
a  measure  of  the  integrated  difference  between  the  cumulative 
distribution  function  (CDF)  of  the  forecast  ensemble  and  the 
CDF  of  the  verifying  analysis  [Wilks,  2006].  The  above  four 
metrics  were  calculated  for  five  different  variables  [surface 
temperature  (which  includes  both  land-surface  temperature 
and  SST),  2-m  air  temperature,  500  hPa  geopotential  height, 
10-m  wind  speed,  and  24-h  accumulated  precipitation]  and 
thirteen  different  regions.  For  a  given  metric,  variable,  and 
region,  the  calculations  were  carried  out  for  each  00  UTC 
forecast  ensemble  initialization  in  the  period  1  May  2007  to 
30  June  2007.  Then,  the  difference  between  the  score  for 
SSTidf  and  the  corresponding  score  for  the  control  was  cal¬ 
culated  for  each  initialization.  Finally,  the  differences  in  score 
for  all  initializations  were  averaged.  A  statistical  significance 
test  of  the  average  difference  in  score  between  the  SSTidf 
forecast  ensemble  and  the  control  ensemble  for  a  given  metric, 
variable,  and  region  was  carried  out  using  a  method  that 
accounts  for  serial  dependence  of  data  [Wilks,  2006,  pp.  143- 
145].  Cases  where  SSTidf  scores  significantly  better  (worse) 
than  the  control  are  colored  green  (red),  and  cases  where  no 
significant  difference  in  score  is  found  are  colored  light  grey. 


Figure  8.  (a)  Average  amplification  factor  of  the  ensemble  SST  variance  over  the  interval  T  +  000  h  to  T  +  240  h. 
(b)  Amplification  factor  of  the  SST  forecast  error  variance  over  the  interval  T  +  000  h  to  T  +  240  h.  (c)  Average  ensemble 
SST  variance  ([°C]2)  at  T  +  240  h  for  ensemble  SSTidf.  (d)  SST  forecast  error  variance  ([°C]2)  at  T  +  240  h  for  ensemble 
SSTidf. 
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In  the  case  of  24-h  accumulated  precipitation,  cross-hatching 
denotes  regions  for  which  the  verification  metrics  were  not 
calculated  because  the  TRMM  data  set  is  only  available  for  the 
50°N  latitude  band. 

[52]  Considering  Figures  9a-9c  together,  it  is  clear  that 
SSTidf  scores  significantly  better  than  the  control  in  the 
overwhelming  majority  of  cases  for  the  tropical  regions. 
Particularly  notable  is  the  consistent  improvement  gained  by 
SSTidf  for  tropical  regions  in  terms  of  the  CRPS  and  vari¬ 
ance  metrics.  SSTidf  also  gains  a  considerable  advantage  for 
tropical  regions  in  terms  of  RMSE  for  the  variables  1 0-m  wind 
speed  and  24-h  accumulated  precipitation.  The  improvement 
gained  by  SSTidf  is  not  only  positive  and  statistically  signifi¬ 
cant  in  a  large  number  of  cases,  it  is  often  nontrivial  in  terms  of 
magnitude.  This  is  illustrated  by  Figure  10,  which  compares 
the  SSTidf  and  control  ensembles  in  terms  of  the  forecast 
variance  of  24  h  accumulated  precipitation  in  the  Tropics 
(Figure  10a)  and  the  CRPS  of  10-m  wind  speed  in  the  tropical 
Indian  Ocean  (Figure  10b).  The  SSTidf  ensemble  yields  gains 
equivalent  to  several  days  or  more  of  forecast  skill  at  longer 
lead  times  in  both  Figures  10a  and  10b.  For  example,  in 
Figure  10b  the  CRPS  of  SSTidf  at  T  +  240  h  is  roughly 
equivalent  to  that  of  the  control  at  T  +  168  h. 

[53]  Considering  again  Figures  9a-9c,  one  finds  that 
SSTidf  has  much  less  of  an  impact  in  the  Midlatitudes,  and 
particularly  very  little  impact  on  midlatitude  500  hPa  geo¬ 
potential  height.  This  result  is  similar  to  that  of  Takaya  et  al. 
[2010a],  Nonetheless,  in  those  cases  where  SSTidf  does 
have  an  impact  in  the  Midlatitudes,  the  cases  with  positive 
impacts  outnumber  the  cases  with  negative  impacts.  The 
overall  least  impact  is  to  the  SH  Midlatitudes,  which  is  to  be 
expected  given  the  high  wind  speeds  over  the  southern 
oceans.  There  are  only  a  few  cases  where  SSTidf  scores 
persistently  worse  than  the  control,  these  being  namely  the 
bias  of  surface  temperature  in  the  southern  Pacific  Ocean, 
the  North  Atlantic  Ocean,  the  tropical  eastern  Pacific  Ocean, 
and  tropical  Atlantic  Ocean,  and  the  RMSE  of  the  surface 
temperature  in  the  tropical  Atlantic  Ocean.  At  T  +  120  h 
(Figure  9a),  SSTidf  exhibits  worse  RMSE  of  surface  tem¬ 
perature  in  the  tropical  latitudes  closest  to  the  Equator,  but 
there  is  no  significant  effect  at  the  later  lead  times.  SSTidf 
also  exhibits  worse  RMSE  and  CRPS  for  500  hPa  geopo¬ 
tential  height  over  the  North  Atlantic  Ocean  at  the  interme¬ 
diate  lead  time  of  T  +  240  h  (Figure  9b),  but  there  is  no 
similar  indication  at  the  earlier  lead  time  of  T  +  120  h 


(Figure  9a)  nor  at  the  later  time  of  T  +  336  h  (Figure  9c). 
Some  of  these  cases  might  be  remedied  through  the  sys¬ 
tematic  retuning  of  other  NOGAPS  physical  parameteriza- 
tions  such  as  the  boundary  layer  and  surface-flux  schemes, 
something  that  was  not  attempted  for  this  study.  Also,  the 
overall  favorable  results  of  Figure  9  suggest  that  the  statis¬ 
tically  significant  changes  to  ensemble-mean  physical 
properties  that  were  discussed  in  section  3.1.1  are  in  fact  of 
practical  significance  as  well. 

[54]  The  ensemble  diagnostics  were  further  stratified  by 
verifying  the  SSTidf  and  control  forecast  ensembles  only  at 
ocean  grid  points  (Figure  11a)  and  only  at  land  grid  points 
(Figure  1  lb).  For  brevity  only  the  results  for  T  +  240  h  are 
shown.  Also,  the  land-point  diagnostics  were  restricted  to  just 
five  of  the  13  regions,  because  the  other  seven  regions 
encompassed  very  few  land  grid  points.  The  ocean-grid  point 
results  are  largely  similar  to  the  full-grid  results  presented  in 
Figure  9.  They  show  the  general  superiority  of  SSTidf  in 
tropical  regions,  and  the  lesser  impact  of  SSTidf  in  the  Mid¬ 
latitudes.  Flowever,  they  reveal  that  SSTidf  degrades  the  SST 
bias  in  the  Tropics,  mainly  due  to  the  forecasts  for  the  tropical 
eastern  Pacific  Ocean.  The  land-grid  point  results  (Figure  1  lb) 
also  are  overall  favorable  and  much  the  same  as  the  full-grid 
results  in  Figure  9.  One  new  detail  that  emerges  is  that  SSTidf 
degrades  the  over-land  tropical  2-m  air  temperature  bias  and 
hence  the  over-land  tropical  2-m  air  temperature  CRPS. 
Nonetheless,  the  results  also  show  that  SSTidf  is  not  signifi¬ 
cantly  worse  in  terms  of  any  metric  for  the  over-land  NH 
Midlatitudes,  in  contrast  to  the  over-ocean  case. 

[55]  The  test  period  of  the  study  (1  May  2007  to  30  June 
2007)  encompassed  one  complete  cycle  of  the  Madden- 
Julian  Oscillation  (MJO).  As  a  final  diagnostic,  this  episode 
of  the  MJO  was  evaluated  using  the  Wheeler  and  Hendon 
[2004]  Real  Time  Multivariate  MJO  index  (RMM).  The 
RMM  index  provides  information  on  the  geographic  loca¬ 
tion  and  strength  of  the  MJO  signal.  No  meaningful  differ¬ 
ence  was  found  between  the  RMM  index  from  the  SSTidf 
ensemble  and  that  from  the  control  ensemble  in  terms  of 
either  phase  or  amplitude  of  the  MJO  signal  (not  shown). 

3.3.  Role  of  SST  Initial  Perturbations 

[56]  All  the  results  described  thus  far  pertain  to  forecast 
SSTidf,  where  the  forecast  ensemble  is  generated  using  the 
diurnal  SST  model  in  conjunction  with  SST  analysis  per¬ 
turbations  produced  by  the  local  ET.  Still  to  be  determined  is 


Figure  9.  (a)  Performance  of  the  SSTidf  ensemble  relative  to  the  control  ensemble  at  T  +  120  h.  Varying  across  the  left  side 
of  the  figure  are  the  metrics  [BIAS,  RMSE,  VAR,  CRPS]  and  the  variables  [surface  temperature  (sfctmp)  (°C),  2-m  air  tem¬ 
perature  (airtmp2m)  (°C),  500  hPa  geopotential  height  (geopht500)  (m),  10-m  wind  speed  (10  m Wjnd)  (ms-1),  24-h  accumu¬ 
lated  precipitation  (24  h  precip)  (mm)].  Varying  across  the  top  of  the  figure  are  the  regions  [NM  (NH  Midlatitudes,  30°N- 
65°N,  0°E-360°E),  SM  (SH  Midlatitudes,  30°S-65°S,  0°E-360°E),  SI  (southern  Indian  Ocean,  60°S-30°S,  0°E-120°E), 
SPAC  (South  Pacific  Ocean,  60°S-30°S,  180°W-90°W),  NPAC  (North  Pacific  Ocean,  30°N-60°N,  130°E-120°W),  NATL 
(North  Atlantic  Ocean,  30°N-70°N,  70°W-0°E),  T3  (tropical  band  3,  30°S-30°N,  0°E-360°E),  T2  (tropical  band  2,  20°S- 
20°N,  0°E-360°E),  T1  (tropical  band  1,  10°S-10°N,  0°E-360°E),  TI  (tropical  Indian  Ocean,  20°S-20°N,  40°E-110°E), 
TWP  (tropical  western  Pacific  Ocean,  20°S-20°N,  1 10°E-160°W),  TEP  (tropical  eastern  Pacific  Ocean,  20°S-20°N, 
160°W-80°W),  TATL  (tropical  Atlantic  Ocean,  20°S-20°N,  70°W-0°E)].  Cases  where  SSTidf  scores  significantly  better 
(worse)  than  the  control  are  colored  green  (red),  and  cases  where  no  significant  difference  in  score  is  found  are  colored  light 
grey.  In  the  case  of  24-h  accumulated  precipitation,  cross-hatching  denotes  regions  for  which  the  verification  metrics  were 
not  calculated.  See  text  for  further  details,  (b)  As  for  Figure  9a,  but  at  T  +  240  h.  (c)  As  for  Figure  9a,  but  at  T  +  336  h. 
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Figure  9.  (continued) 


the  effect  of  the  diurnal  SST  model  in  isolation  from  the 
SST  analysis  perturbations  and,  vice  versa,  the  effect  of  the 
SST  analysis  perturbations  in  isolation  from  the  diurnal  SST 
model.  Here,  such  effects  are  examined  through  comparison 
of  the  three  forecasts  SSTdf,  SSTi,  and  SSTidf. 

[57]  Figure  12a  summarizes  the  performance  of  the  SSTdf 
forecast  ensemble  in  relation  to  the  control  ensemble  at 
T  +  240  h  for  ocean  grid  points.  Overall,  one  can  infer  that 
the  diurnal  forcing  is  beneficial  to  the  forecast,  since  the 
cases  where  SSTdf  is  improved  relative  to  the  control  out¬ 
number  those  where  it  is  degraded.  However,  the  number  of 
cases  with  improvement  is  not  overwhelming,  in  contrast  to 
the  results  for  SSTidf  (Figure  1  la). 

[58]  Figure  12b  summarizes  the  performance  of  the  SSTi 
forecast  ensemble  in  relation  to  the  control  ensemble  at 
T  +  240  h  for  ocean  grid  points.  Considering  the  results  for 
the  tropical  regions,  it’s  apparent  that  the  SST  analysis  per¬ 
turbations  are  beneficial  to  the  forecast;  SSTi  wins  in  the 
vast  majority  of  cases,  and  particularly  for  10-m  wind  speed 
and  24-h  accumulated  precipitation.  On  the  other  hand,  SSTi 
engenders  some  adverse  effects  versus  the  control  in  the  NH 
Midlatitudes. 

[59]  Finally,  a  comparison  is  made  between  the  forecasts 
from  SSTidf  and  SSTdf  in  Figure  13.  From  this  figure,  it  is 
clear  that  SSTidf  is  superior  to  SSTdf  for  the  tropical  regions. 
Thus,  inclusion  of  the  SST  analysis  perturbations  provides  a 
clear  benefit  to  forecasts  for  the  Tropics  over  and  above  that 
of  diurnal  SST  variation  alone.  Inclusion  of  the  SST  analysis 
perturbations  is  particularly  beneficial  to  tropical  10-m  wind 


speed  and  tropical  24-h  accumulated  precipitation.  Consid¬ 
ering  the  midlatitude  regions  in  Figure  13,  one  finds  that 
inclusion  of  the  SST  analysis  perturbations  provides  con¬ 
siderably  fewer  gains.  The  few  cases  where  SSTidf  performs 
significantly  worse  are  associated  with  the  North  Atlantic 
region  and  with  the  bias  of  SST  in  the  Tropics. 

4.  Conclusions 

[60]  A  modified  version  of  the  Takaya  et  al.  [2010b] 
prognostic  diurnal  SST  scheme  is  incorporated  into  the 
NOGAPS  global  spectral  model.  Additionally,  the  local-ET 
analysis  perturbation  scheme  is  adapted  to  generate  analysis 
perturbations  to  both  atmospheric  variables  and  SST.  The 
NOGAPS  model  with  diurnal  SST  and  the  ET  analysis 
perturbations  are  used  in  conjunction  to  generate  336  h  lead 
time,  32-member  forecast  ensembles  for  each  00  UTC 
analysis  time  in  the  May-June  2007  period.  The  resulting 
forecast  ensembles  are  diagnosed  both  in  terms  of  various 
physical  properties  and  in  terms  of  forecast  performance. 
Physical  properties  that  are  examined  include  the  nature  of 
surface  fluxes,  OLR,  and  near-surface  atmospheric  vari¬ 
ables,  the  diurnal  cycle  of  precipitation,  and  the  DSA.  Per¬ 
formance  metrics  that  are  examined  include  bias  and  RMS 
error  of  the  ensemble  mean,  ensemble  variance,  and  CRPS. 

[61]  The  SST  variation  is  found  to  introduce  statistically 
significant  changes  to  the  ensemble-mean  state  of  all  of  the 
physical  variables  that  are  analyzed.  The  changes  to  the 
ensemble-mean  state  likely  are  introduced  by  two  principal 
sources,  one  being  the  nonlinear  evolution  of  the  ensemble 
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Figure  10.  (a)  Average  variance  of  24-h  accumulated  pre¬ 
cipitation  for  the  SSTidf  ensemble  (solid  red  line)  and  the 
control  ensemble  (solid  black  line)  for  the  Tropics  [0°E  to 
360°E,  20°S  to  20°N],  Observed  forecast  error  variance  for 
the  SSTidf  (control)  ensemble  given  by  the  dashed  red 
(black)  lines.  Filled  black  (grey)  circles  indicate  SSTidf  pro¬ 
vides  statistically  significant  (insignificant)  improvement  rel¬ 
ative  to  the  control  at  the  corresponding  lead  times,  (b)  CRPS 
of  10-m  wind  speed  for  the  SSTidf  ensemble  (red  line)  and 
control  ensemble  (black  line)  for  the  tropical  Indian  Ocean 
[40°E  to  1 10°E,  20° S  to  20°N],  Meaning  of  filled  circles  as 
for  Figure  10a. 

perturbations  by  the  N  WP  model  and  the  other  being  a  slight 
increase  in  the  daily  average  SST  owing  to  the  chosen  con¬ 
figuration  of  the  model  of  SST  diurnal  variation.  The  chan¬ 
ges  are  most  prevalent  in  the  tropical  regions,  but  in  some 
cases  extend  to  the  higher  latitudes  of  the  NH.  Also,  the  SST 
variation  leads  to  a  midday  peak  in  the  diurnal  cycle  of 
precipitation  in  low-wind  regimes  in  both  the  Indian  Ocean 
and  western  Pacific  Ocean.  This  midday  peak  is  more  con¬ 
sistent  with  observations  and  is  not  found  in  a  control 
ensemble  that  lacks  all  SST  variation  (the  control  was  gen¬ 
erated  with  a  prescribed  SST  field  that  was  held  constant 
throughout  integration).  Composites  of  the  forecast  DSA 
show  a  direct  relationship  with  wind  speed,  as  expected,  but 
the  forecast  DSA  is  often  too  large  (small)  in  regimes  of  low 
(high)  wind  speed. 

[62]  Very  favorable  results  are  gained  from  the  SST  vari¬ 
ation  in  terms  of  ensemble  performance  metrics.  The  fore¬ 
casts  with  SST  variation  maintain  appreciable  ensemble  SST 


variance  in  both  the  Tropics  and  the  Midlatitudes  throughout 
the  336  h  forecast  interval.  More  notably,  these  forecasts  are 
overall  superior  in  the  tropical  regions  when  compared  to 
forecasts  from  the  control  ensemble.  Statistically  significant 
gains  in  forecast  performance  are  seen  for  all  the  variables 
that  are  analyzed,  including  surface  temperature  (both  land- 
surface  and  SST),  2-m  air  temperature,  500  hPa  geopotential 
height,  10-m  wind  speed,  and  24-h  accumulated  precipita¬ 
tion.  Furthermore,  significant  gains  are  seen  in  all  the 
ensemble  metrics,  but  particularly  in  the  variance  metric  and 
the  probabilistic  CRPS  metric.  The  broad-based  gains  dem¬ 
onstrate  that  the  SST  variation  is  important  to  multiple 
aspects  of  the  forecast  probability  distribution,  including 
both  the  mean  and  the  variance.  At  longer  lead  times  the 
magnitude  of  the  gains  can  be  meaningful,  in  some  cases 
exceeding  several  days’  worth  of  forecast  skill.  Stratification 
of  the  diagnostics  using  only  land  grid  points  or  only  ocean 
grid  points  leads  to  similar,  favorable  results.  Inclusion  of 
the  SST  variation  has  much  less  of  an  impact  in  the  Mid¬ 
latitudes,  but  even  here  the  SST  variation  is  beneficial  on  the 
whole. 

[63]  A  further  comparison  addressed  the  individual  impact 
of  the  diurnal  SST  variation  and  the  SST  analysis  perturba¬ 
tions  on  the  performance  of  the  forecast  ensemble.  Findings 
indicate  that  the  diurnal  SST  variation  and  the  SST  analysis 
perturbations  each  are  overall  beneficial  to  the  forecast  per¬ 
formance  in  the  tropical  regions.  However,  the  performance 
gains  from  the  SST  analysis  perturbations  are  much  broader 
than  those  from  the  diurnal  SST  variation.  On  the  other 
hand,  the  SST  analysis  perturbations  lead  to  some  adverse 
effects  in  the  NH  midlatitude  regions  that  are  not  seen  when 
using  only  diurnal  SST  variation.  Finally,  it  is  shown  that  the 
forecasts  generated  using  both  the  diurnal  SST  variation  and 
the  SST  analysis  perturbations  are  superior  in  the  tropical 
regions  as  compared  to  forecasts  generated  using  SST  diur¬ 
nal  variation  alone.  Thus,  the  SST  analysis  perturbations  are 
essential  to  obtain  the  best  forecast  performance. 

[64]  Certain  aspects  of  the  forecast  ensemble  with  SST 
variation  stand  to  be  improved  through  future  work.  As 
underscored  in  section  2,  the  model  of  diurnal  SST  variation 
used  in  this  study  neglects  the  cool-skin  effect  of  the  Takaya 
et  al.  [2010b]  model.  This  effect  should  eventually  be 
incorporated  for  more  realism.  Also,  some  issues  arise  with 
the  SST  analysis  perturbations.  For  instance,  inclusion  of 
these  perturbations  causes  small  but  statistically  significant 
degradation  of  the  500  hPa  geopotential  height  over  the 
North  Atlantic  Ocean  at  lead  times  near  T  +  240  h.  Further, 
the  variance  of  the  SST  analysis  perturbations  is  too  large  in 
the  Tropics.  These  issues  might  be  addressed  through  mod¬ 
ifications  of  the  analysis  perturbation  structure  and  statistics. 
For  instance,  if  the  negative  impacts  over  the  North  Atlantic 
Ocean  are  not  due  to  teleconnections,  then  they  might  be 
mitigated  by  constraining  the  SST  variation  to  lie  within  the 
40  N  latitude  band  in  the  manner  of  Takaya  et  al.  [2010a]. 
Additional  problems,  such  as  a  degradation  of  over-land  2-m 
air  temperature  bias,  a  reduction  of  the  peak  early  morning 
precipitation  in  the  Indian  Ocean  and  South  China  Sea,  and 
the  too-large  forecast  DSA  in  low  wind  speed  conditions, 
also  need  to  be  addressed.  Some  of  these  problems  may  be 
mitigated  through  systematic  retuning  of  the  NOGAPS 
physical  parameterizations. 
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Figure  11.  (a)  Performance  of  the  SSTidf  ensemble  relative  to  the  control  ensemble  at  T  +  240  h  for 
ocean  grid  points  only.  Notation  as  for  Figure  9.  (b)  As  for  Figure  11a,  but  for  land  grid  points  only. 
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Figure  12.  (a)  Perfomiance  of  the  SSTdf  ensemble  relative  to  the  control  ensemble  at  T  +  240  h  for 
ocean  grid  points  only.  Cases  where  SSTdf  scores  significantly  better  (worse)  than  the  control  are  colored 
green  (red),  and  cases  where  no  significant  difference  in  score  is  found  are  colored  light  grey.  Notation  as 
for  Figure  9.  (b)  Performance  of  the  SSTi  ensemble  relative  to  the  control  ensemble  at  T  +  240  h  for  ocean 
grid  points  only.  Cases  where  SSTi  scores  significantly  better  (worse)  than  the  control  are  colored  green 
(red),  and  cases  where  no  significant  difference  in  score  is  found  are  colored  light  grey.  Notation  as  for 
Figure  9. 
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Figure  13.  Performance  of  the  SSTidf  ensemble  relative  to  the  SSTdf  ensemble  at  T  +  240  h  for  ocean 
grid  points  only.  Cases  where  SSTidf  scores  significantly  better  (worse)  than  SSTdf  are  colored  green 
(red),  and  cases  where  no  significant  difference  in  score  is  found  are  colored  light  grey.  Notation  as  for 
Figure  9. 
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